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DATA COMMUNICATIONS 

This invention relates to data communications, and in particular to die 
communication of multimedia documents from multimedia servers to 
5 multimedia clients. 

The Internet currently provides users with a number of different 
services. One of such services is the Worldwide Web, or "the Web", which has 
now become relatively well-known and well-used by fixed terminal subscribers. 
To access the Web, a user instals a "browser" application on a terminal, 
1 o which acts as a client interfacing with a network of distributed Web servers via 
the Internet 

In order to access a page on the Web, a user specifies a universal 
resource locator (URLX and transmits a page request via a data link to the 
Internet In the Internet, the request is routed to the appropriate Web server as 

15 identified in the URL. In return, the Web server transmits the pages back to the 
requesting browser. The communications protocol used within the Internet is 
TC/IP during this dialogue. 

Web pages are currently formatted using the HyperText markup 
language (HTML). A Web browser, on receipt of a HTML document, 

20 interprets the HTML commands in order to format the text correctly. 

HyperText links may be included in the document which, when presented to the 
user, may be selected in order to request a further Web page (a HTML link 
references a further URL with which the browser retrieves the further page). 
Web pages currently contain not only text and hyperlinks, but also image , audio 

25 and/or video files (hence, Web pages are referred to as "multimedia" 
documents). 

Although the textual data in the Web page is generally relatively data- 
compact, it is recognised that graphics files, audio files, and in particular video 
files, contain relatively large amounts of data which can reduce related 
30 download of Web pages considerably. As a result the downloading of "true" 
multimedia documents including graphics, audio and/or video files can be 
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cumbersome, in particular where the data link used into the Internet is of a type 
which is of a relatively low data rate. 

The speed of download of multimedia Web pages is particularly 
problematic in the case of mobile data links such as a cellular radio data link, in 
5 which the radio data transfer is constrained by the relatively low bandwidth of 
the radio interface link. 

Therefore, although it is currently possible to access the Web at a mobile 
communications terminal, the option is currently not widely taken advantage of 
and, when taking advantage o£ die data transferred is generally limited to 
10 textual data to achieve reasonable download speeds. 

Methods are known whereby the amount of data transfer involved in 
accessing the Web are reduced. In particular, many browsers support a caching 
facility whereby a Web page, once downloaded, is temporarily stored locally on 
the client terminal. When the Web page is next requested, the browser 
15 recognises, by the URL in the request corresponding with the URL of the 
cached document, that the document is currently held locally, and retrieves the 
locally-held page in preference to re-downloading the page from the Web. The 
browser may transmit an update check to the Web server in question in order to 
confirm that the locally-stored Web page remains valid, and if not, downloads 
20 the updated Web page. 

Caching may be performed on a session-by-session basis, or may be 
performed such that a cached Web page remains stored between sessions. 
However, a browser maintains only a limited non-volatile cache of Web pages, 
and newly-cached Web pages are stored in preference to previously-stored Web 
2 5 pages (which are then deleted) when the cache becomes full. 

In accordance with one aspect of the present invention there is provided 
a multimedia client terminal, said terminal comprising: 

a browser for interpreting a multimedia document received from a 
remote server, said interpreting means comprising: 
30 means for recognising textual presentation markup tags in said 

document and presenting text to a user in accordance with said markup tags; 
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means for recognising a standard set of document-independent local 
library file markup tags in said document; 

means for storing a set of non-textual local library files corresponding to 
said local library tags; and 

means for presenting the contents of one of said local library files to a 
user in response to the recognition of one of said local library file tags in said 
multimedia document 

Rather than needing to download non-textual files accompanying the 
text of a document, the client terminal is able to receive one or more of a known 
set of document-independent and data-efficient markup tags and access locally- 
stored files which correspond to the tags in the downloaded document 

The arrangement of the present invention is to be contrasted with the 
arrangement in which the client terminal holds a non-volatile cache (although, 
such a cache may be used in combination with the present invention), in that 
caching mechanisms do not utilise document-independent local library markup 
tags. Tngtra* a document is cached by selection of a URL, and it is unlikely 
that, when a new multimedia document is downloaded, any parts of the 
document will already be held locally. 

The use of a non-volatile cache is non-systematic, and does not allow a 
server content developer to design multimedia documents specifically for low 
data-rate communication links, which is possible with the present invention. 

In addition, a non-volatile cache arrangement does not allow a user of a 
low data-rate receiving terminal to be confident that a new multimedia 
document can be downloaded at an acceptable speed, which is possible with the 
present invention. 

By defining a standard set of document-independent local library file 
tags, the present invention allows a server content developer to include one or 
more of such tags within a document to be placed on a server in the knowledge 
that a user, provided with a client application for recognising the predefined set 
of local library tags and for retrieving appropriate locally-stored files, will be 



able to download the document relatively quickly and receive each of the 
intended multimedia parts as intended. 

The present invention is of particular utility in relation to mobile 
communications terminals. 

Further aspects of the invention are set out in the appended claims. 

Further features of embodiments of the invention are as follows: 

1. The system employs tags to locally stored graphical images, 
drawing primitives, pre-drawn images, image manipulators, audio and video 
files. 

2. A first set of tags provides basic two dimensional drawing 
functionality, enabling the creation of a range of simple but effective composite 
images by means of combinations of drawing primitives. 

3. The client functionality supports a standard predefined range of 
pre-drawn images that can add further graphical content to a page, over and 
above that achieved using the drawing primitives. This enables an image to be 
retrieved from the local repository to be displayed at the specified co-ordinates. 
A scaling factor may be specified to dynamically resize the image and a 
hyperlink can be optionally provided. 

4. The client functionality supports the manipulation of on-screen 
bitmap images. 

5. A second set of tags provides basic audio playback capability. 
The system employs tags to stored speech files as well as the ability to interpret 
"talking Web pages" through text to speech translation software. 

6. The client functionality supports a standard predefined range of 
audio files enabling a file to be retrieved from the local repository, and spoken 
by a speech agent at the client 

7. The speech agent supports default vocal and fecial styles in order 
to allow different genders, accents and languages into the playback. Different 
fecial appearances may be selected for the agent The combination of attributes 
may be used to create agents with different personalities. 
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8. A third set of tags provides basic video clip playback capability. 
The system employs tags to stored video files. 

9. The client functionality supports a standard predefined range of 
video files enabling a file to be retrieved from the local repository and replayed 

5 at the client 

10. In one embodiment, tags implementing the new functionality are 
embedded into Web pages, which may also contain HTML, and content 
handlers filter out the tags implementing the new functionality before resolving 
them. 

10 11. In another embodiment, references to files implementing the 

new functionality are embedded into Web pages, which may also contain 
HTML, as markups (e.g. for graphics resources <imgsrc="imagesAexLhgml r *>; 
for audio resources <spesrc="audio/testhsml>; and for video resources 
<vidsrc= "video/testhvml">). 
15 12. In a further embodiment, pages implementing the new 

functionality are a separate resource, which may be hyperlinked in the 
corresponding Web pages containing HTML (eg. 
htqp^/www.testcom/texthgml). 

13. The drawing functionality provided by tags to graphics 
2 0 primitives supports Web page navigation by means of hyperlinks. 

Embodiments of the invention will now be described, by way of 
example only, with reference to the accompanying drawings, wherein: 

Figure 1 shows a schematic block diagram illustrating a document 
retrieval system in accordance with an embodiment of the invention; 
25 Figure 2 illustrates a client/server functionality in accordance with an 

embodiment of the invention; 

Figure 3 illustrates a predefined set of document-independent graphics 
tags and a set of corresponding graphics elements; 

Figure 4 illustrates an alternative set of graphics elements to be used 
30 with the predefined set of graphics tags illustrated in Figure 3; and 



WO 99/05613 



6 



PCT/GB98/U2177 



Figure 5 illustrates a screen display provided in accordance with an 
embodiment of the invention. 

Referring to Figure 1, a multimedia document retrieval system in 
accordance with an embodiment of the invention includes a mobile client 
5 terminal 2 communicating via a mobile communications network 4, such as a 
GSM digital mobile communications network, and the Internet 6 with the Web 
server 7. 

The mobile client terminal 2 includes a user interface 8, a client 
application 10, a radio communications front end 12 and a radio transceiving 
10 antenna 14. 

The user interface 8 includes a keypad, a display screen and a 
loudspeaker for accepting user input and outputting images and audio signals 
respectively. 

The client application 10 is a customised Web browser, such as a 
15 customised version of the Netscape Explorer (Registered Trademark) browser, 
ami includes standard content handlers such as a text content handler 16 and an 
image content handler 1 8. 

In addition, the client application 10 includes, as provided in accordance 
with the present invention, a graphics tag handler 20, an audio tag handler 22 
20 and a video tag handler 24. These content handlers may be components of the 
browser 10 produced in a development language such as Java, or JavaBeans 
(Registered Trademark). Associated with each of the tag handlers are a standard 
graphics library 26, a standard audio library 28 and a standard video library 30, 
containing content files which are document-independent The graphics library 
25 consists of graphics files GF 1-X. The audio library consists of a number of 
audio files AF 1-Y, and the video library consists of a number of standard video 
files VF1-Z. 

The radio communications front end 12 includes amplifiers, frequency 
selective circuits, etc which are used to perform radio communications via the 
30 radio transceiving antenna 1 4. 
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The mobile client terminal 2 may be one of various types of terminal, 
for example a laptop computer provided with radio communications facilities, a 
personal digital assistant (PDA), or a mobile handset provided with Web 
browsing capabilities. 
5 Radio signals transmitted and received by the radio transceiving antenna 

14 of the mobile client terminal 2 are received and transmitted by a cell site 
antenna 32 of the mobile communications network 4, via a radio link 33. Only 
one ceil site antenna is illustrated but it will be appreciated that a large number 
of such antf!f"»flg are provided in the mobile communications network 4, and die 
10 particular serving cell site antenna will depend on the location of the mobile 
client terminal 2 at any particular time. The cell-site antenna interfaces, via a 
base station 34, to a mobile switching exchange 36 of the mobile 
communications network, which maintains a circuit connection 38 with an 
Internet gateway 40. 

15 The Internet gateway in turn interfaces via the Internet 6 to the Web 

server 7, using a TCP/IP connection 42. 

The server 7 holds a set of Web pages 44, each being referenced by 
different URLs. 

The contents of one such Web page 46 are illustrated in Figure 1. The 
20 Web page 46 contains a textual portion 48, to be handled by the text content 
handler 1 6, an image file portion 50, to be handled by the image content handler 
18, one or more of a standard predefined set of document-independentgraphics 
tags 52, to be handled by graphics tag handler 20, one or more standard 
predefined document-independent audio tags 54, to be handled by audio tag 
25 handler 22, and one or more of a standard predefined set of document- 
independent video tags 56. Others of the Web pages 44 also contain text 
content and the graphics tags, audio tags and video tags used in the illustrated 
document 46 may be repeated in the remaining Web pages of the set 44 held on 
the server. 

3 o When the client terminal 2 transmits the appropriate URL request to the 

server 7, via the various data links indicated the server 7 transmits the Web 
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page back, in HTML format, to the mobile client terminal 2. At the client 
terminal 2, the various content handlers 1 6-24 interpret the HTML to present the 
text and image content 48, 50 of the Web page as indicated by the HTML 
command therein, and the tag handlers recognise the particular graphics, audio 

5 and/or video tags referenced in the document 46, and present the contents of the 
corresponding file in the libraries 26, 28, 30 as indicated by commands 
associated with the respective tags. 

Figure 2 illustrates functionality of a client/server at the content handler 
level, in accordance with one embodiment of the invention. In this 

10 embodiment, three different "markup-languages" are provided for developing 
Web pages on the server ate, and which are interpreted by the tag handlers 20, 
22, 24 of the system shown in Figure 1. These languages are referred to herein 
.as the hyper-graphic mark-up language (HGML), the hyper-speech mark-up 
language (HSML) and hyper-video mark-up language (HVML). 

15 Designed to extend the power of the Web to mobile communications 

applications, HGML, HSML and HVML provide for fast communications over 
narrowband radio channels, such as GSM, while also being functional on higher 
bandwidth systems. 

Through the use of graphical files, audio clips and/or video clips 

20 provided on the mobile client terminal in a HGML/HSML/HVML Toolkit, the 
need for network programming may be significantly reduced, and it is possible 
to embed tags referencing advanced graphical, speech, video and animation 
which is stored on the client side into Web pages. 

The content handler interprets the HGML/HSML/HVML commands 

25 into graphics primitives and calls to access stored graphical images and 
speech/video files. It is multi-threaded to allow multiple paints onto the page at 
a given time. 

HGML, HSML and/or HVML commands may be embedded in a Web 
page containing HTML, in which case the client side content handlers process 
30 the page and filter out the HGML, HSML and/or HVML commands before 
resolving them. 
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References to HGML, HSML and/or HVML resource files may be 
embedded into Web pages, containing HTML, as markups (e.g. <imgsre- 
" images/testhgml^). 

Alternatively, HGML/HSML/HVML pages may be hyperlinked as 
5 separate resources in Web pages containing HTML (eg. 
http^Avww.testcom/testhgml). 

Figure 2 illustrates the case in which the HGML/HSML/HVML 
resources are embedded as markups. Figure 2 details how content handlers (as 
exemplified by a HGML content handler) manage content of variable type from 
10 a server to display it on a Web page. 

The HGML content handler interprets the HGML commands into either 
graphics primitive or calls to access stored graphical images. There are a 
number of areas of functionality (winch may be defined in object classes) on fee 
client side: 

15 ■ Content handling: The upper object acting as a container for 

the other required functionalities; 

■ Connection: To handle the network connection to the server 
and the retrieval of HGML resources from the server; 

■ Command interpreter/Text processing: Takes the HGML 
20 resource and processes it into the various graphics requirements (i.e. if more 

than one picture resides in the resource, etc); and 

■ Painting: Takes the commands that make a specific HGML 
graphic and draw the required graphics to the appropriate place in the browser 
graphical user interface. Again, this could be primitives or stored images. A 

2 5 Paint object extends a thread class and is managed by the command interpreter, 

allowing multiple paints to be performed at the same time. 
HGML 

HGML provides for 

1. Construction of graphics from a standard set of graphics 

3 o primitives (circles, lines, etc). These are described below. 
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2. Recall and placement of a standard set of logo/clipart graphics 
from a client based store. Examples are illustrated in Figures 3 and 4. 

3. Prevention of download of photo-realistic images. 

4. Yielded graphics which act as hyperlinks. 

> HGML includes various tags or commands, which can be grouped into 



three functional categories, as shown below. 
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Drawing Primitives 

The first group of document-independent tags provide HGML with 
10 basic drawing functionality, enabling the creation of a range of simple but 
effective composite images. 

Most commands include optional attributes for (out)line colour, style, 
thickness and, when appropriate, fill colour. If any of those that are not 
explicitly specified, the current default settings (as specified by the SETSTYLE 
15 tag) are used. 
ARC 
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Draws an arc of a specified width, height and angle from absolute 
starting coordinates. It has the following attributes: 

origin x, y coordinates of arc (absolute) followed by width, height, start 
angle and arc angle, line colour, line style, and line thickness. 
5 e.g. <arccoords="20J0J0J0J5M"color=red> 

ARROW 

Draws a line from xl,yl to x2,y2 using absolute screen (window) 
coordinates, with an arrow head at the x2,y2 position. It has the following 
attributes: 

10 start and end x,y coordinates (absolute), line color, line style, and line 

thickness. 

e.g. <arrow coords="10J0J00, 100" color =black psize="2*> 
ELLIPSE 

Draws an ellipse of a given width and height from the specified 
15 coordinates. This tag is also used to draw circles. The following attributes are 
provided: 

centre x,y coordinates (absolute) followed by width and height, outline 
colour, fill colour, outline style, and outline thickness. 

e.g. <ellipse coords^ "200, 200, 1 00, 50", color=blacK fill=yellow> 

20 LINE 

Draws a line from xl,yl to x2,y2 using absolute screen (window) 
coordinates. The attributes are: 

start andendx,y coordinates (absolute), line colour, line style, and line 
thickness. 

25 e* <limcoords="10J0J00JM",style="dotted".pste 

UNETO 

Draws a line from die current graphics cursor position to a new position 
(using absolute cooidinates). The following attributes are defined: 

end of line x,y coordinates (absolute), line colour, line style, and line 
30 thickness. 

e.g. <lineio coords="200. 150", color=green> 
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LINEREL 

Draws a line relative to the current graphics cursor position using 
coordinate offsets. The following are its attributes: 

end of line coordinates (relative), line colour, line style, and line 
thickness. 

e.g. <linere!coords= n 15,-10",psize="4"> 

LINK 

Allows a rectangular portion of the screen to be defined as an hyperlink. 
The following attributes arc given: 

rectangular region xl ,yl,x2,y2 (absolute), hyperlink reference. 

e.g. <tink coords="15J0,50 t 60", hrej^"http://www.demoxom/ 
demo.hgm"> 
ORIGIN 

Sets the position of the "graphics cursor*. For use with LINETO and 
LINEREL tags. New graphics cursor position x,y is attributed. 

e.g. <origin coords="100, 130"> 
POLYGON 

Draws a polygon using a list of absolute screen coordinates, starting 
with an origin. Coordinates are listed in the format w xl,yl,x2,y2,x3,y3 ... 
xn,yn". The line from the last specified coordinates back to the origin of the 
shape is drawn automatically to complete the polygon. The following attributes 
are provided: 

origin of shape, followed by n other points (all absolute coordinates), 
outline colour, fill colour, outline style, and outline thickness. 
e.g. <-six-sided-shape-> 

<polygon coonb=V00J00J20M140M160JWW120 t 

120,120", color=red,fill=red> 
RECTANGLE 

Draws a rectangle of a given width and height from the specified 
coordinates. This tag is also used to draw squares. The following attributes are 
defined: 
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top left x,y coordinates (absolute) followed by width and height 
dimensions, outline colour, fill colour, outline style, and outline thickness. 

e.g. <rectangfecoords=V0J0J00J5",style="dotted"> 
SETSTYLE 

5 Enables default settings for outline colour, fill colour and outlines style 

and thickness to be specified. These settings are then used unless explicitly 
overridden by the attributes in other HGML commands. The attributes are: 

outline colour, fill colour, outline style, outline thickness. 

e.g. <setstyle color=red, fill=greeri> 

10 TEXT 

Displays a text string, starting from the specified coordinates. It has the 
following attributes: 

text position x,y coordinates (ahsolute), text string to display, and text 

colour. 

15 e.g. <text coords="10>50" text="This is an example of HGML 

text"> 

STORED IMAGES 

A standard range of pre-drawn images are stored on the mobile client 
terminal, associated with a standard document-independent set o f graphics tags 
20 as exemplified in Fig. 3. These images may be used to provide further graphical 
content to a page, in addition to that which may be achieved using the drawing 
primitives. 
IMAGE 

Enables an image to be retrieved from the local repository and displayed 
25 at the specified coordinates. The current theme will be used unless another one 
is explicitly specified. A scaling factor may be specified to dynamically resize 
the image and a hyperlink can be optionally provided. The attributes are: 

image name, theme style, top left coordinates of image, scaling factor 
(percent), hyperlink reference. 
30 e.g. <image name = ARROW L coords="50 9 50", scaled 50, 

href= n http:/Avww. demo, com/ demo, hgm "> 
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THEME 

Enables a new current image theme to be selected. The attributes are: 

name of image theme, 

e.g <theme=DEFA ULT> 
5 An example of a set of images which may be associated with an 

alternative theme (Theme 2"). As shown, different themes may not have 
defined images for all of the predefined set of tags - some may not be used in 
associated with a particular theme. Only a set number of theme styles are 
provided for. 

10 At a high level, themes are split into three general categories: 

■ Navigation 

Images of buttons for a range of relatively standard Web page functions. 
The structure of this category is strictly defined by HGML such that images are 
referenced by standard names and relate to the same type of image irrespective 
15 of the theme being used (Le. HOME, SEARCH, ARROW etc). 

■ Design 

A range of standard page design constructs in terms of lines, bullets, 
separators, etc. The structure of this category is strictly defined by HGML such 
that images are referenced by standard names and relate to the same type of 
20 image irrespective of the theme being used (i.e. LINEK LINE2, BULLET1 etc). 

The navigation and design categories have a prescribed content (i.e. die 
first element in the Design category is always a line etc). As such, changing die 
theme associated with a page will mean that, whilst it looks different, the 
images displayed will still be meaningful and appropriate (e.g. a line will still 
2 5 appear where intended albeit in a different style). 

■ User 

User-defined images of which die content is specific to the thane. 
HGML defines standard names (Le. USER1, USER2 etc), but these do not 
provide any indication of die type of image being referenced. 
30 The client side HGML functionality provides at least a default theme. 

Other themes (defined for example by individual site designers) may be 
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downloaded from a remote server the first time a service that uses them is 
accessed. Alternatively, a user may have a personal preferred theme which may 
be provided on the client terminal and used in preference to the default and/or 
downloaded theme. 

5 User-defined theme elements are provided to permit a degree of 

flexibility to Web site developers, whilst still remaining optimisation 
advantages* 

This group allows local storage of other images, outside the standard 
categories prescribed. These may be used to reference any images that the 
10 site/theme designer wishes. It is anticipated that this will include images 
relating to corporate identity (e.g. logos etc) to further standardise the look and 
feel of their pages. They could, however, also include additional line and bullet 
styles etc that could not be encompassed under the standard headers provided. 
The number of user-defined images is limited to a predetermined 
15 number (e.g. ten) in order to prevent the image repository from becoming 
undesirably large. 

All images not included in the standard theme may need to be 
downloaded each time a site is accessed (although they may be cached from a 
previous session). At such, it is intended that user-defined slots are utilised to 
20 hold the largest and most used images (thus helping to further optimise 
download times). 

Whilst user-defined images are envisaged primarily as site-specific, 
however other designers may wish to reuse them in their own sites. In this case, 
there are two alternatives: 
25 (1) to incorporate the required images into the user-defined slots of 

the site* s own theme; or 

(2) to make the site reliant on two or more themes (i.e. a default, 
along with the one(s) containing the images to be reused). In this case, the end- 
user obtains all of the necessary themes before being able to access the site. The 
3 o site provider would provide download access for each theme. 
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Tma op Manipulation 

HGML supports the manipulation of the onscreen bitmap image using 
COPY, PASTE, FLIP ami ROTATE commands. 

These are intended to further optimise the creation of certain types of 
5 image. For example, if a composite image created using several drawing 
primitives is also required at another onscreen location, then it may be copied 
and pasted rather than redrawn. The orientation of it may then be changed using 
the FLIP and ROTATE operations. 

The functions also help to optimise the storage of theme images. For 
10 example, rather than store four Arrow symbols (pointing left, right, up and 
down), a single image may be stored in the database and then ROTATEd and 
FLIPed as desired. 
COPY 

Copies of specified rectangular screen segment to a memory buffer. The 
15 original image remains unchanged The attributes are: 
top left and bottom right coordinates of segment. 
e.g. <copycoorcis=' , 50,50J00 t 100"> 

FLIP 

Performs a horizontal or vertical flip of a screen defined by the specified 
20 coordinates. The attributes are: 

top left and bottom right coordinates of segment and horizontal or 
vertical flip. 

e.g. <flip coords="50.50J00, 100",axis="horiz"> 
PASTE v 
25 Copies of the contents of a memory buffer to an appropriately sized 

rectangular screen segment, starting at the specified coordinates. The attributes 
are: 

top left coordinates to paste segment 
e^. <pastecoords="100J00'*> 
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ROTATE 

Performs a rotation of a circular screen area by a specified angle. The 
attributes are: 

coordinates of centre, radius and rotation angle. 
e.g. <rotate coords="50,50,30",artgle="80"> 
HSML 

Figure 5 illustrates a screen display on the mobile client terminal 2 when 
displaying a Web page containing HSML. 

As illustrated, the browser displays the text content, formatted as 
specified by the HTML markups in the Web page, and a speech agent image 
portion 102, which is driven in accordance with the HSML markups contained 
or referenced in the downloaded Web page in association with the replay of 
locally-stored audio clips referenced by audio tags contained in the Web page, 
or the generation of speech by conversion from text contained in the Web page. 

The speech agent interface provided by the mobile client terminal is 
controlled by a speech playback/generatorprogramme application and a moving 
image application programme stored on the mobile terminal. 

The speech agent has a variety of selectable vocal and facial styles. The 
various selectable vocal styles are defined in vocal style files stored on the 
mobile client terminal, containing data describing algorithms and settings for 
manipulating the audio data providing the speech function to alter the vocal 
characteristics. The facial style selections are defined in fecial style files stored 
on the mobile client terminal. The fecial style files include data defining a 
plurality of selectable base facial images, a plurality of selectable mouth images, 
a plurality of image animation algorithms (e.g. talk, smile, wink, etc.) 
corresponding with the base facial images and the mouth images, and a plurality 
of image rendering algorithms for rendering the fecial style in accordance with 
selected characteristics of the face, such as skin colour, hair colour, etc. The 
vocal and fecial style files may be stored permanently on the mobile terminal 
and/or stored in non-volatile memory on the mobile terminal after being 
downloaded from an Internet server. 



WO 99/05613 



18 



PCT/GB98/02177 



The speech agent function provided in the mobile client terminal is 
provided with predefined default settings, in which the vocal and fecial style 
characteristics are preset without input from the user. These characteristics 
include voice gender, accent, language, base facial style, skin colour, hair 
5 colour, eye colour, mouth size, etc. The user may however override the default 
vocal and facial style characteristics settings, by resetting one or more of the 
characteristics via menu options on a man-machine interface of the mobile 
terminal, in order to provide a personalised default speech agent. The default 
speech agent settings, whether predefined or user-defined, are used unless an 
10 alternative agent theme is specified by mark-ups contained in a Web page, as 
described below. If an alternative agent theme is specified, the vocal style and 
the fecial style of the speech agent is altered only for the duration of the display 
of the Web page in question, after which time the default speech agent is 
reverted to. 

15 HSML includes the following markups: 



Audio Playback 


Agent Style 


Agent Control 


SAY 


AGENT THEME 


ACTION 



SAY 

This markup causes an audio file to be retrieved from, or generated 
using data retrieved from, the local repository and "spoken" by a speech agent 
20 The default speech agent theme, defining the vocal and facial styles of the 
speech agent, will be used unless another one is explicitly specified. The 
attribute required is a phrase name: 

eg. <sqy phare^e= f9 GREETING! n > 

"GREETING!" may be a tag to an audio file held in the audio graphics 
25 library 28, or may be replayed using a text-to-speech translator on the client 
side. 
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AGENT THEME 

This markup causes the vocal and facial style combination for the 
speech agent to be altered from the default Voice style attributes may be used 
in order to incorporate different genders, accents and/or languages into the 
5 playback. Face style attributes allow different appearances to be selected for the 
speech agent The combination of attributes may be used to create agents with 
different personalities. The attributes which may be included are: 

voice gender, accent, language, base fecial style, skin colour, hair 
colour, eye colour, mouth size, etc. 
10 e.g. <agent theme^emale3>EnglishEnglishfemcde5,paleMack 

bwym y medxum> 
ACTION 

This markup controls the image of the speech agent, determining 
whether it is to be displayed and any movements to be made in conjunction with 
15 audio playback. The attributes are an action for the agent (possible actions: 
appear, talk, smile, wink, etc.) and an action activation status (on/off)- e.g. 

<action=tcdk,ori> 

The action markups are referenced by a moving image handler to 
produce and control die image of the speech agent When the appear action is 

20 activated, the speech agent appears at a specified part of the screen display. 

When deactivated, the speech agent does not appear. When the talk action is 
activated, the moving image handler animates the mouth of the speech agent in 
conjunction with any speech being generated by the SAY command. Other 
actions (e.g. smile, wink) are also suitably animated when activated. 

25 HVML 

HVML supports a single tag, namely PLAYBACK, which has a file 
reference attribute. When parsing Web page, the HVML tag is interpreted by 
the video tag handler, which retrieves a video file from the standard video file 
library 30 which corresponds with the reference contained in the command. 

30 It will be appreciated that the HGML, HSML and HVML functionality 

illustrated above are provided for exemplary purposes, and are not intended to 
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be limiting. It will be appreciated that various modifications and variations may 
be employed without departing from the scope of the present invention. 
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CLAIMS: 

1 . A multimedia client terminal, said terminal comprising: 

a browser for interpreting a multimedia document received from a 
5 remote server, said interpreting means comprising: 

means for recognising textual presentation markup tags in said 
document and presenting text to a user in accordance with said markup tags; 

means for recognising a standard set of document-independent local 
library file markup tags in said document; 
10 means for storing a set of non-textual local library files corresponding to 

said local library tags; and 

means for presenting the contents of one of said local library files to a 
user in response to the recognition of one of said local library file tags in said 
multimedia document 

15 

2. A multimedia document server, said server comprising storage 
means holding a set of multimedia documents, one or more of said documents 
comprising: 

textual presentation commands; and 
20 one or more of a standard set of document-independent markup tags to 

be recognised by a client terminal downloading said one or more documents as 
references to non-textual files which are stored in said client terminal. 

3* A document retrieval system comprising a client terminal in 
25 accordance with claim 1 and a document server in accordance with claim 2. 



4, Apparatus according to any preceding claim, wherein said client 
terminal is a mobile commimicationstenninaL 
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5. Apparatus according to any preceding claim, wherein said non- 
textual files comprise one or more of graphics files, audio files and/or video 
files. 

5 6. Apparatus according to any preceding claim, wherein said non- 

textual files comprise data for speech synthesis, and said tags define speech to 
be synthesized 

7. A communications terminal for accessing Web pages via an 
10 Internet protocol connection, said terminal comprising a browser for interpreting 
Web pages and presenting the contents thereof on a display portion of the 
communications terminal, said browser comprising means for recognising two 
dimensional graphics markup tags in a Web page and for rendering a graphical 
object on said display portion on the basis of a plurality of said graphics markup 
15 tags. 

g. A communications terminal according to claim 7, wherein said 
graphics markup tags comprise line drawing commands and/or image 
manipulation commands. 

20 

9. A Web server comprising Web pages describing a two 
dimensional graphical object by means of the graphics markup tags used by the 
mobile communications terminal of claim 7 or 8. 

25 10. A multimedia client terminal, said terminal comprising: 

a browser for interpreting a multimedia document received from a 

remote server, said interpreting means comprising: 

means for recognising speech presentation markup tags in said 

document; 

30 means for storing a set of local library files corresponding to said speech 

presentation markup tags; and 
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means for presenting said speech to a user in accordance with the local 
library files tags found in said multimedia document 

11. A client terminal according to claim 10, wherein said speech is 
5 presented in conjunction with a graphical display comprising a moving facial 

image. 

12. A client terminal according to claim 11, wherein the 
characteristics of said moving facial image are selectable by a user of the client 

10 terminal 

13. A mobile communications terminal adapted to generate 
graphical image data defining a fecial image which is animated to simulate 
actions accompanying speech generated from data received via a radio interface, 

1 5 wherein the characteristics of said fecial image and/or said speech are selectable 
by a user of said client terminal. 
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